SkyDB: Skyline Aware Query Evaluation Framework
نویسندگان
چکیده
In recent years much attention has been focused on evaluating skylines, however the existing techniques primarily focus on skyline algorithms over single sets. These techniques face two serious limitations, namely (1) they define skylines to work on a single set only, and (2), they treat skylines as an “add-on”, loosely integrated on top of the query plan. In this work, we investigate the evaluation of skylines over disparate sources via joins. We then propose SkyDB a skyline aware query evaluation framework that addresses four key issues that enable the treatment of skylines as a first-class citizen in query processing. First, we extend the relational model to include skyline-aware operators. Second, for there new operators we design execution strategies that are tuned to exploit the skyline knowledge. Third, we propose our skyline aware query optimizer to effectively choose between the query plan execution strategies. In the literature, we observe that evaluating of skylines over joins is considered to be blocking. Therefore, existing approaches focus only on reducing the skyline evaluation time rendering them inapplicable for response-time sensitive applications. Fourth, we thus aim to transform the execution of skylines over joins to non-blocking so that SkyDB can produce progressive output of results. Our preliminary performance study demonstrates the superiority of our proposed methodologies over existing techniques by outperforming them in many cases by several orders of magnitude.
منابع مشابه
Skyline and Mapping Aware Query Evaluation Across Disparate Data Sources
Growing interests in multi-criteria decision support applications have resulted in a flurry of efficient skyline algorithms. In practice, real-world decision support applications require to access data from disparate sources. Existing techniques define the skyline operation to work on a single set, and therefore, treat skylines as an “add-on" on top of a traditional Select-Project-Join query pl...
متن کاملCaching Dynamic Skyline Queries
Given a query tuple q, the dynamic skyline query retrieves the tuples that are not dynamically dominated by any other in the data set with respect to q. A tuple dynamically dominates another, w.r.t. q, if it has closer to q’s values in all attributes, and has strictly closer to q’s value in at least one. The dynamic skyline query can be treated as a standard skyline query, subject to the transf...
متن کاملSimultaneous Processing of Multi-Skyline Queries with MapReduce
With rapid increase of the number of applications as well as the sizes of data, multi-query processing on the MapReduce framework has gained much attention. Meanwhile, there have been much interest in skyline query processing due to its power of multi-criteria decision making and analysis. Recently, there have been attempts to optimize multi-query processing in MapReduce. However, they are not ...
متن کاملLink-based Ranking of Skyline Result Sets
Skyline query processing has received considerable attention in the recent past. Mainly, the skyline query is used to find a set of non dominated data points in a multi-dimensional dataset. One of the major drawbacks of the skyline operator is the high cardinality of the result set. Providing the most interesting points of the skyline set (top-k) inherently involves the ranking of the skyline p...
متن کاملAGiDS: A Grid-Based Strategy for Distributed Skyline Query Processing
Skyline queries help users make intelligent decisions over complex data, where different and often conflicting criteria are considered. A challenging problem is to support skyline queries in distributed environments, where data is scattered over independent sources. The query response time of skyline processing over distributed data depends on the amount of transferred data and the query proces...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009